2. The method of claim 1, wherein relevance includes importance. 



3. A method of focused crawling, comprising: 

/ accessing a query input including at least a first query part and a second query part; 

crawling a plurality \f documents, at least some of the plurality of documents including 
links to each other, the crawling at least partly guided by a crawl metric, the crawl metric at least 
partly determined by a first mechanism and by the first query part; and 

returning target documents, the target documents being relevant to the second query part, 
the target documents found from the plurality of crawled documents, the target documents 
returned at least partly based on a^search metric, the search metric at least partly determined by a 
second mechanism and by the second query part. 

4. The method of claim 3, wherein relevance includes importance. 

5. A method of focused crawling\comprising: 
accessing a query input; 

crawling a plurality of documents), the documents including links to each other, and the 
crawling at least partly guided by a crawl metric, the crawl metric at least partly determined by a 
first mechanism, the first mechani sm includ ing a first combination, the first combination 
including a first pluralSy^oTone^rmore procedures, the first plurality of one or more procedures 
including one or more of: 1) evaluating relevance of documents using logical expressions of 
keywords and phrases, 2) evaluating relevance of documents using a template including a 
plurality of one or more template portions, at least one of the template portions including a first 
plurality of one or more hierarchical levels, 3) evaluating relevance of documents using a link 
structure of the crawled documents, and 4) evaluating relevance based on freshness of 
documents; and 

returning target documents, the target documents being relevant to the query input, the 
target documents found from the plurality of crawled documents, the target documents returned 
at least partly based on a search metric, the search metric^at least partly determined by a second 
mechanism, the second mechanism including a second combination, the second combination 
being different from the first combination, the second combination including a second plurality 
of one or more procedures, the second plurality of procedures including one or more of: 1 ) 
evaluating relevance of documents using logical expressions of keywords and phrases, 2) 
evaluating relevance of documents using a template including k plurality of one or more template 
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portions, at least one of the template portions including a second plurality of one or more 
hierarchical levels, 3) evaluating relevance of documents using a link structure of the crawled 
documents, and 4) evaluating relevance based on freshness of documents. 

6. The method of claim 5, wherein relevance includes importance. 

7. The method of claim 5, wherein at least one of the first mechanism and the second 
mechanism includes: 

associating a weight to each c^f the evaluated relevances of the procedures; and 

combining the evaluated relevances and the weights of the evaluated relevances. 

8. The method of claim 5, whereimone or more of: 1) the first plurality of one or more 
hierarchical levels and 2) the second plurality of one or more hierarchical levels, includes at least 
one or more heading levels and one or more content levels. 

9. The method of claim 5, wherein evaluating relevance includes evaluating relevance of at 
least a first document and one or more of a first plurality of one or more referring documents and 
a second plurality of one or more referring documents, each of the first plurality of one or more 
referring documents referring to the first document directly, and each of the second plurality of 
referring documents referring to the first document indirectly through one or more documents. 

10. A method of focused crawling, comprising:\ 
accessing a query input; 

crawling a plurality of documents, the documents including links to each other, and the 
crawling at least partly guided by a crawl metric, the crawl metric at least partly determined by a 
first mechanism, the first mechanism including a first combination, the first combination 
including a first plurality of one or more procedures, the first plurality of one or more procedures 
including one or more of: 1) evaluating relevance of documents using logical expressions of 
keywords and phrases, 2) evaluating relevance of documents\using a template including a 
plurality of one or more template portions, at least one of the template portions including a first 
plurality of one or more hierarchical levels, 3) evaluating relevance of documents using a link 
structure of the crawled documents, and 4) evaluating relevance pased on freshness of 
documents; and 



Attorney Docket No.: 247 1 7-706 

C :\NrPortbl\PALI B 1 \DH 1 \ 1 408686_ 1 . DOC 



3 



returning targekdocuments, the target documents being relevant to the query input, the 
target documents found\from the plurality of crawled documents, the target documents returned 
at least partly based on a^search metric, the search metric at least partly determined by a second 
mechanism, the second mechanism including a second combination, the second combination 
being different from the first combination, the second combination including a second plurality 
of one or more procedures, the second plurality of procedures including one or more of: 1) 
evaluating relevance of documents using logical expressions of keywords and phrases, 2) 
evaluating relevance of documents using a template including a plurality of one or more template 
portions, at least one of the template portions including a second plurality of one or more 
hierarchical levels, 3) evaluating^relevance of documents using a link structure of the crawled 
documents, and 4) evaluating relevance based on freshness of documents, 

wherein the procedure, of the first plurality of one or more procedures, of evaluating 
relevance of documents using a link^tructure of the crawled documents, includes: 

accessing a first plurality of documents from a database of a plurality of received 
documents, the plurality of received documents including crawled documents, the first plurality 
of documents to be ranked; 

generating a graph of the\first plurality of documents; 

assigning weights to one or more nodes of the graph; 

finding an assignment of weights to one or more nodes of the graph, by 
propagating weights through the graph, the\assignment of weight to a node based at least in part 
on calculating a weighted sum of weights propagated from neighboring nodes; and 

generating a ranked list of at least the first plurality of documents, the ranked list 
at least partly generated from the graph. 

1 1 . The method of claim 1 0, wherein relevance includes importance. 

12. The method of claim 10, wherein at least one of the first mechanism and the second 
mechanism includes: 

associating a weight to each of the evaluated relevanc es c^ f. the procedures; and 

combining the evaluated relevances and the weights of the evaluated relevances. 
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13. The method of claim 10, wherein one or more of: 1) the first plurality of one or more 

hierarchical levels and 2) the second plurality of one or more hierarchical levels, includes at least 
\ * 

one or more heading levels and one or more content levels. 



\ 



14. The method of claim 10, wherein evaluating relevance includes evaluating relevance of 
at least a first document and one or more of a first plurality of one or more referring documents 
and a second plurality of one or more referring documents, each of the first plurality of one or 
more referring documents referring to the first document directly, and each of the second 
plurality of referring documents referring to the first document indirectly through one or more 
documents. 

15. The method of claim 10^ wherein the procedure, of the first plurality of one or more 
procedures, of evaluating relevance of documents using a link structure of the crawled 
documents, further comprises: 

expanding the graph with a second plurality of one or more documents from the database, 
wherein a third plurality includes a union of the first plurality of documents and the second 
plurality of documents, and the third olurality of documents is smaller than the plurality of 
received documents. 

16. The method of claim 10, wherein the procedure, of the first plurality of one or more 
procedures, of evaluating relevance of documents using a link structure of the crawled 
documents, further comprises: 

expanding the graph with a second plurality of one or more documents from the database, 
such that a third plurality includes a union of the first plurality of documents and the second 
plurality of documents, and the third plurality of documents is smaller than the plurality of 
received documents, the second plurality including, one or more of: 1) one or more documents 
connected within a first specified number of links in\a forward direction from one or more 
documents of the first plurality of documents, the forward direction being forward from the first 
plurality of documents, and 2) one or more documents connected within a second specified 
number of links in a backward direction from one or more documents of the first plurality of 
documents, the backward direction being backward from the first plurality of documents. 

17. The method of claim 10, wherein the procedure, of the first plurality of one or more 
procedures, of evaluating relevance of documents using a link^tructure of the crawled 
documents, further comprises: 
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